Logistic Regression for Crystal Growth Process Modeling through Hierarchical Nonnegative Garrote based Variable Selection
نویسندگان
چکیده
Logistic Regression for Crystal Growth Process Modeling through Hierarchical Nonnegative Garrote based Variable Selection Hongyue Sun, Xinwei Deng, Kaibo Wang, and Ran Jin Grado Department of Industrial and Systems Engineering, Virginia Tech., Blacksburg, VA 24061, USA Department of Statistics, Virginia Tech., Blacksburg, VA 24061, USA Department of Industrial Engineering, Tsinghua University, Beijing 100084, China Abstract Single-crystal silicon ingots are produced from a complex crystal growth process. Such a process is sensitive to subtle process condition changes, which may easily become failed and lead to the growth of a polycrystalline ingot instead of the desired monocrystalline ingot. Therefore, it is important to model this polycrystalline defect in the crystal growth process and identify key process variables and their features. However, to model the crystal growth process poses great challenges due to complicated engineering mechanisms and a large amount of functional process variables. In this paper, we focus on modeling the relationship between a binary quality indicator for polycrystalline defect and functional process variables. We propose a logistic regression model with hierarchical nonnegative garrote based variable selection method, which can accurately estimate the model, identify key process variables, and capture important features. Simulations and a case study are conducted to illustrate the merits of the proposed method in prediction and variable selection. [Supplemental materials are available for this article. Go to the publisher’s online edition of IIE Transactions for the supplemental materials.]
منابع مشابه
Logistic Regression with the Nonnegative Garrote
Logistic regression is one of the most commonly applied statistical methods for binary classification problems. This paper considers the nonnegative garrote regularization penalty in logistic models and derives an optimization algorithm for minimizing the resultant penalty function. The search algorithm is computationally efficient and can be used even when the number of regressors is much larg...
متن کاملOn the Nonnegative Garrote Estimator
We study the nonnegative garrote estimator from three different aspects: computation, consistency and flexibility. We show that the nonnegative garrote estimate has a piecewise linear solution path. Using this fact, we propose an efficient algorithm for computing the whole solution path for the nonnegative garrote estimate. We also show that the nonnegative garrote has the nice property that wi...
متن کاملRobust nonnegative garrote variable selection in linear regression
Robust selection of variables in a linear regression model is investigated. Many variable selection methods are available, but very few methods are designed to avoid sensitivity to vertical outliers aswell as to leverage points. The nonnegative garrotemethod is a powerful variable selection method, developed originally for linear regression but recently successfully extended to more complex reg...
متن کاملStructured Variable Selection and Estimation
In linear regression problems with related predictors, it is desirable to do variable selection and estimation by maintaining the hierarchical or structural relationships among predictors. In this paper, we propose nonnegative garrote methods that can naturally incorporate such relationships defined through effect heredity principles or marginality principles. We show that the methods are very ...
متن کاملVariable Selection for Sparse High-Dimensional Nonlinear Regression Models by Combining Nonnegative Garrote and Sure Independence Screening.
In many regression problems, the relations between the covariates and the response may be nonlinear. Motivated by the application of reconstructing a gene regulatory network, we consider a sparse high-dimensional additive model with the additive components being some known nonlinear functions with unknown parameters. To identify the subset of important covariates, we propose a new method for si...
متن کامل